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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 



(i) APPLICANT: 

(A) \NAME: Studiengesellschaft Kohle mbH 

(B) VrREET: Kaiser-Wilhelm-Platz 1 

(C) CITY: Muelheim an der Ruhr 

(E) COUNTRY: Germany 

(F) POSTAL CODE (ZIP) : 45470 

(ii) TITLE OF INVENTION: A Process for the Preperation and 
Identification of Novel Hydrolases Having Improved 
Properties, 

(iii) NUMBER OF SEQU^JNCES: 21 

(iv) COMPUTER READABLE*. FORM : 

(A) MEDIUM TYPE:\Floppy disk 

(B) COMPUTER: IBM\PC compatible 

(C) OPERATING SYSTBM: PC-DOS/MS-DOS 

(D) SOFTWARE: PatentsJntf Release #1.0, 



Version #1.30 (EPO) 



(2) INFORMATION FOR SEQ ID NO:flV 



(i) SEQUENCE CHARACTERISTICS^ 

(A) LENGTH: 30 base pairs> 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown ' 
<D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: other nucleic 
(A) DESCRIPTION: /desc 



cid 



"synthetic DNA" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 
GCGCAATTAA CCCTCACTAA AGGGAACAAA 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

GCGTAATACG ACTCACTATA GGGCGAA 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1049 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANfi(EDNESS : unknown 

(D) TOPOLOcSY: unknown 



(ii) MOLECULE TYPE\: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/KEY: \CDS 

(B) LOCATION: 



(ix) FEATURE: 

(A) NAME/KEY: 



iViat_peptide 



(B) LOCATION: 1^3.71017 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
GGATCCCCCG GTTCTCCCGG AAGgVlTTCGG GCGATGGCTG GCAGGACGCG CCCCTCGGCC 
CCATCAACCT GAG AT GAG AA CAAC 



AAG AAG AAG TAT CTG CTC CCC CTC 
Lys Lys Lys Tyr Leu Leu Pro Leu 
£6 -25 -20 

CTC GCT GCC AGC CCT CTG ATC CAG 
Leu Ala Ala Ser Pro Leu lie Gin 
-5 



GGC CTG GCC ATC GGT CTC GCC 

Gly Leu Ala lie Gly Leu Ala | 
-15 

GCC AGC ACC TAC ACC CAG ACC AAA\ TAC CCC ATC GTG CTG GCC CAC GGC 

Ala Ser Thr Tyr Thr Gin Thr Lys\Tyr Pro He Val Leu Ala His Gly 

1 5 \ 10 

ATG CTC GGC TTC GAC AAC ATC CTC GGG GTC GAC TAC TGG TTC GGC ATT 
Met Leu Gly Phe Asp Asn He Leu G\y Val Asp Tyr Trp Phe Gly He 
20 \ 25 30 

CCC AGC GCC TTG CGC CGT GAC GGT GCC\CAG GTC TAC GTC ACC GAA GTC 
Pro Ser Ala Leu Arg Arg Asp Gly Ala\Gln Val Tyr Val Thr Glu Val 
35 40 \ 45 

AGC CAG TTG GAC ACC TCG GAA GTC CGC g\c GAG CAG TTG CTG CAA CAG 

Ser Gin Leu Asp Thr Ser Glu Val Arg Gly Glu Gin Leu Leu Gin Gin 
50 

GTG GAG GAA ATC GTC GCC CTC AGC GGC CAG \CCC AAG GTC AAC CTG ATC 

Val Glu Glu He Val Ala Leu Ser Gly Gin Pro Lys Val Asn Leu He 
65 70 \ 75 

GGC CAC AGC CAC GGC GGG CCG ACC ATC CGC TAC GTC GCC GCC GTA CGT 
Gly His Ser His Gly Gly Pro Thr He Arg TyV Val Ala Ala Val Arg 
80 85 9Q 95 

CCC GAC CTG ATC GCT TCC GCC ATC AGC GTC GGC VSCC CCG CAC AAG GGT 
Pro Asp Leu He Ala Ser Ala He Ser Val Gly Via Pro His Lys Gly 
100 105 \ HO 

TCG GAC ACC GCC GAC TTC CTG CGC CAG ATC CCA C&G GGT TCG GCC GGC 
Ser Asp Thr Ala Asp Phe Leu Arg Gin He Pro Prfo Gly Ser Ala Gly 
115 120 \ 125 

GAG GCA GTC CTC TCC GGG CTG GTC AAC AGC CTC GGC >GCG CTG ATC AGC 
Glu Ala Val Leu Ser Gly Leu Val Asn Ser Leu Gly Ala Leu He Ser 
130 135 lV0 



60 
111 
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TTC CTT TCC AGC GGC\GGC ACC GGT ACG 
Phe Leu Ser Ser Gly ^Gly Thr Gly Thr 
145 \ 150 



CAG AAT 
Gin Asn 



TCA CTG 
Ser Leu 
155 



GGC TCG 
Gly Ser 



CTG 

Leu 



GAG TCG CTG AAC AGC GAG GGT GCC GCG 
Glu Ser Leu Asn Ser Gl\ Gly Ala Ala 
160 165 > 



CGC TTC 
Arg Phe 
170 



AAC GCC 
Asn Ala 



AAG TAC 
Lys Tyr 



CCG 
Pro 
175 



CAG GGC ATC CCC ACC TCG &CC TGC GGC 
Gin Gly lie Pro Thr Ser m.a Cys Gly 
180 



GAA GGC 
Glu Gly 
185 



GCC TAC 
Ala Tyr 



AAG GTC 
Lys Val 
190 



AAC 
Asn 



GGC GTG AGC TAT TAC TCC TGK AGC GGT 
Gly Val Ser Tyr Tyr Ser Trp\ Ser Gly 
195 \ 200 



TCC TCG 
Ser Ser 



CCG CTG 
Pro Leu 



ACC AAC 
Thr Asn 
205 



TTC 
Phe 



CTC GAT CCG AGC GAC GCC TTC STC GGC 
Leu Asp Pro Ser Asp Ala Phe Leu Gly 
210 2 



GCC TCG 
Ala Ser 



TCG CTG 
Ser Leu 
220 



ACC TTC 
Thr Phe 



AAG 
Lys 



AAC GGC ACC GCC AAC GAC GGC CWGTC 
Asn Gly Thr Ala Asn Asp Gly LeuAVal 
225 230 



GGC ACC 
Gly Thr 



TGC AGT 
Cys Ser 
235 



TCG CAC 
Ser His 



CTG 
Leu 



GGC ATG GTG ATC CGC GAC AAC TAC uGG 
Gly Met Val lie Arg Asp Asn Tyr Arg 
240 245 



ATG AAC 
Met Asn 
250 



CAC CTG 
His Leu 



GAC GAG 
Asp Glu 



GTG 
Val 
255 



AAC CAG GTC TTC GGC CTC ACC AGC CTG 
Asn Gin Val Phe Gly Leu Thr Ser Leu\ 
260 



TTC GAG 
Phe Glu 
65 



ACC AGC 
Thr Ser 



CCG GTC , 
Pro Val 
270 



GTC TAC CGC CAG CAC GCC AAC CGC CTG . 
Val Tyr Arg Gin His Ala Asn Arg Leu 
275 280 



GCC AGC 
. Ala Ser 



CTG 
Leu 
285 



TAGGACCCCG GCCGGGGCCT CGGCCCGGGC CC 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 311 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 S 

Met Lys Lys Lys Tyr Leu Leu Pro Leu Gly Leu AUa lie Gly Leu Ala 
-26 -25 -20 -i5 

Ser Leu Ala Ala Ser Pro Leu lie Gin Ala Ser Thi Tyr Thr Gin Thr 

-10 -5 1 

Lys Tyr Pro lie Val Leu Ala His Gly Met Leu Gly \he Asp Asn lie 
10 15 \ 20 



Leu Gly Val Asp Tyr Trp Phe Gly lie Pro Ser Ala L&u Arg Arg Asp 
25 30 
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Gly Ala Gin Val Tyr Vil Thr Glu Val Ser 
40 \ 45 



Gin Leu Asp Thr 
50 



Val Arg Gly Glu Gin Leu\ Leu Gin Gin Val Glu Glu He Val 
55 60' 



Ala Leu 
70 



Ser Gly Gin Pro Lys Val Isn Leu He Gly His Ser His Gly 
75 



Gly Pro 
85 



Thr He Arg Tyr Val Ala Ala Val Arg Pro 
90 \ 95 



Asp Leu He Ala 
100 



He Ser Val Gly Ala Pro Hit Lys Gly Ser 
105 \ 110 



Asp Thr Ala Asp 
115 



Arg Gin He Pro Pro Gly SerVAla Gly Glu 
120 125 



Ala Val Leu Ser Gly Leu 
130 



Val Asn Ser Leu Gly Ala Leu Me Ser Phe 
135 140 



Leu Ser Ser Gly 
145 



Gly Thr 
150 



Gly Thr Gin Asn Ser Leu Gly SeM Leu Glu Ser Leu Asn Ser 
155 ft) 160 



Glu Gly 
165 



Ala Ala Arg Phe Asn Ala Lys Tyr Pro Gin 
170 



Cys Gly Glu Gly Ala Tyr Lys Val Afen Gly 
185 190 



Ser Gly Ser Ser Pro Leu Thr Asn Phd Leu 
200 205 



Gly He Pro Thr 
180 



Val Ser Tyr Tyr 
195 



Asp Pro Ser Asp 
210 



Ser Ala 
Ser Trp 
Ala Phe 



Leu Gly Ala Ser Ser Leu Thr Phe Lys ft sn 
215 220 



Gly Thr Ala Asn 
225 



Asp Gly 
230 



Leu Val Gly Thr Cys Ser Ser His Leu Gl 
235 



Met Val He Arg 



Asp Asn 
245 



Tyr Arg Met Asn His Leu Asp Glu Val Asn) 
250 255 



Gin Val Phe Gly 
260 



Ser Leu Phe Glu Thr Ser Pro Val Ser Val 
265 270 



yr Arg Gin His 
275 



Arg Leu Lys Asn Ala Ser Leu 
280 285 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1049 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 85. .1017 



(ix) FEATURE: 

(A) NAME/fcEY: mat_peptide 

(B) LOCATION: 163. .1017 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 5: 
GGATCCCCCG GTTCTCCCGG VagGATTCGG GCGATGGCTG GCAGGACGCG CCCCTCGGCC 



CCATCAACCT GAGATGAGAA ' 



AC ATG AAG AAG AAG TCT CTG CTC CCC CTC 
Met Lys Lys Lys Ser Leu Leu Pro Leu 
-26 -25 -20 



60 
111 



GGC CTG GCC ATC GGT CTC GCC TCT CTC GCT GCC AGC CCT CTG ATC CAG 
Gly Leu Ala lie Gly Leu Ala Ser Leu Ala Ala Ser Pro Leu lie Gin 
-15 \ -10 -5 

GCC AGC ACC TAC ACC CAG AQC AAA TAC CCC ATC GTG CTG GCC CAC GGC 
Ala Ser Thr Tyr Thr Gin ThY Lys Tyr Pro lie Val Leu Ala His Gly 
1 5 

ATG CTC GGC TTC GAC AAC ATCKTC GGG GTC GAC TAC TGG TTC GGC ATT 
Met Leu Gly Phe Asp Asn Ilef%»eu Gly Val Asp Tyr Trp Phe Gly lie 
20 LV 25 30 

CCC AGC GCC TTG CGC CGT GAC GST GCC CAG GTC TAC GTC ACC GAA GTC 
Pro Ser Ala Leu Arg Arg Asp GlV Ala Gin Val Tyr Val Thr Glu Val 
35 \ 40 45 

AGC CAG TTG GAC ACC TCG GAA GTC\CGC GGC GAG CAG TTG CTG CAA CAG 
Ser Gin Leu Asp Thr Ser Glu Val y^rg Gly Glu Gin Leu Leu Gin Gin 
50 55 \ 60 

GTG GAG GAA ATC GTC GCC CTC AGC GGC CAG CCC AAG GTC AAC CTG ATC 
Val Glu Glu lie Val Ala Leu Ser GAy Gin Pro Lys Val Asn Leu lie 
65 70 \ 75 

GGC CAC AGC CAC GGC GGG CCG ACC ATCXCGC TAC GTC GCC GCC GTA CGT 
Gly His Ser His Gly Gly Pro Thr IleWg Tyr Val Ala Ala Val Arg 

90 95 

CCC GAC CTG ATC GCT TCC GCC ACC AGC G^C GGC GCC CCG CAC AAG GGT 
Pro Asp Leu lie Ala Ser Ala Thr Ser vAl Gly Ala Pro His Lys Gly 
100 10% 110 

TCG GAC ACC GCC GAC TTC CTG CGC CAG ATQ CCA CCG GGT TCG GCC GGC 
Ser Asp Thr Ala Asp Phe Leu Arg Gin Ile\Pro Pro Gly Ser Ala Gly 
115 120 \ 125 

GAG GCA GTC CTC TCC GGG CTG GTC AAC AGC «TC GGC GCG CTG ATC AGC 
Glu Ala Val Leu Ser Gly Leu Val Asn Ser Beu Gly Ala Leu lie Ser 
130 135 \ 140 

TTC CTT TCC AGC GGC GGC ACC GGT ACG CAG AAtT TCA CTG GGC TCG CTG 
Phe Leu Ser Ser Gly Gly Thr Gly Thr Gin Asa Ser Leu Gly Ser Leu 
145 150 \155 

GAG TCG CTG AAC AGC GAG GGT GCC GCG CGC TTC W.C GCC AAG TAC CCG 
Glu Ser Leu Asn Ser Glu Gly Ala Ala Arg Phe Asn Ala Lys Tyr Pro 
160 165 170 \ 175 



- 42 - 



CAG GGC ATC CCC ACC YTCG GCC TGC GGC GAA GGC GCC TAC AAG GTC AAC 
Gin Gly lie Pro Thr Eer Ala Cys Gly Glu Gly Ala Tyr Lys Val Asn 
180 \ 185 190 

GGC GTG AGC TAT TAC T&C TGG AGC GGT TCC TCG CCG CTG ACC AAC TTC 
Gly Val Ser Tyr Tyr SeV Trp Ser Gly Ser Ser Pro Leu Thr Asn Phe 
195 \ 200 205 



CTC GAT CCG AGC GAC GCCXTTC CTC GGC 
Leu Asp Pro Ser Asp Ala whe Leu Gly 
210 \ 215 



TCG 
Ser 



TCG CTG 
Ser Leu 
220 



ACC TTC AAG 
Thr Phe Lys 



AAC GGC ACC GCC AAC GAC GSC CTG GTC GGC ACC 
Asn Gly Thr Ala Asn Asp Gly Leu Val Gly Thr 
225 

GGC ATG GTG ATC CGC GAC AAC YrAC CGG ATG AAC 
Gly Met Val lie Arg Asp Asn Tyr Arg Met Asn 
240 245 \ 250 

AAC CAG GTC TTC GGC CTC ACC AGjDCTG TTC GAG 
Asn Gin Val Phe Gly Leu Thr SeK Leu Phe Glu 
260 \ 265 

GTC TAC CGC CAG CAC GCC AAC CGC QLTG AAG AAC 
Val Tyr Arg Gin His Ala Asn Arg Lfeu Lys Asn 
275 2f- 



TGC AGT 
Cys Ser 
235 

CAC CTG 
His Leu 



ACC AGC 
Thr Ser 



GCC AGC 
Ala Ser 



TCG CAC CTG 
Ser His Leu 



GAC GAG GTG 
Asp Glu Val 
255 

CCG GTC AGC 
Pro Val Ser 
270 



CTG 
Leu 
285 



TAGGACCCCG GCCGGGGCCT CGGCCCGGGC CO 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 311 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 

Met Lys Lys Lys Ser Leu Leu Pro Leu Gly 
-26 -25 -20 



i Ala He Gly Leu Ala 
-15 



Ser Leu Ala Ala Ser Pro Leu He Gin Ala Ser 
-10 -5 1 



thr Tyr Thr Gin Thr 
5 



Lys Tyr Pro He Val Leu Ala His Gly Met Leu Gly Phe Asp Asn He 



Leu Gly Val Asp Tyr Trp Phe Gly He Pro Ser 
25 30 



Ala Leu Arg Arg Asp 
35 



Gly Ala Gin Val Tyr Val Thr Glu Val Ser Gin Leu\Asp Thr Ser Glu 
40 45 50 \ 

Glu r\Le Val Ala Leu 



Val Arg Gly Glu Gin Leu Leu Gin Gin Val Glu 
55 60 65 



70 



Ser Gly Gin Pro Lys Val Asn Leu He Gly His 
75 80 



Ser His Gly Gly Pro 
85 



Thr lie Arg Tyr Val AM Ala Val Arg Pro Asp Leu He Ala Ser Ala 
90 \ 95 100 

Thr Ser Val Gly Ala Pro\His Lys Gly Ser Asp Thr Ala Asp Phe Leu 
105 \ 110 115 

Arg Gin He Pro Pro Gly Ser Ala Gly Glu Ala Val Leu Ser Gly Leu 
120 ]\25 130 

Val Asn Ser Leu Gly Ala h\u He Ser Phe Leu Ser Ser Gly Gly Thr 
135 140 \ 145 150 

Gly Thr Gin Asn Ser Leu GlA Ser Leu Glu Ser Leu Asn Ser Glu Gly 
155 \ 160 165 

Ala Ala Arg Phe Asn Ala Lys Vyr Pro Gin Gly He Pro Thr Ser Ala 
170 \ 175 180 

Cys Gly Glu Gly Ala Tyr Lys VM. Asn Gly Val Ser Tyr Tyr Ser Trp 
185 :ppp 195 

Ser Gly Ser Ser Pro Leu Thr Asn\ Phe Leu Asp Pro Ser Asp Ala Phe 
200 205 \ 210 

Leu Gly Ala Ser Ser Leu Thr Phe lys Asn Gly Thr Ala Asn Asp Gly 
215 220 \ 225 230 

Leu Val Gly Thr Cys Ser Ser His Le\u Gly Met Val He Arg Asp Asn 
235 \ 240 245 

Tyr Arg Met Asn His Leu Asp Glu Val\Asn Gin Val Phe Gly Leu Thr 
250 255\ 260 

Ser Leu Phe Glu Thr Ser Pro Val Ser \al Tyr Arg Gin His Ala Asn 
265 270 \ 275 

Arg Leu Lys Asn Ala Ser Leu 
280 285 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1049 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 85. . 1017 

(ix) FEATURE: 

(A) NAME / KEY : mat_peptide 

(B) LOCATION: 163. . 1017 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 



GGATCCCCCG GTTCTCCCGG AAGGATTCGG GCGATGGCTG GCAGGACGCG CCCCTCGGCC 60 
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CCATCAACCT GAGATGAGAl 



GGC CTG 
Gly Leu 



GCC AGC 
Ala Ser 



ATG CTC 
Met Leu 



CCC AGC 
Pro Ser 



GCC ATC GGT 
Ala He Gly 



ACC TAC ACC 
Thr Tyr Thr 



GGC TTC GAC 
Gly Phe Asp 
20 

GCC TTG CGC 
Ala Leu Arg 



CAAC ATG AAG AAG AAG TCT CTG CTC CCC CTC 
Met Lys Lys Lys Ser Leu Leu Pro Leu 
-26 -25 -20 

CT& GCC TCT CTC GCT GCC AGC CCT CTG ATC CAG 
Leu\Ala Ser Leu Ala Ala Ser Pro Leu He Gin 
-5 

CAG AtC AAA TAC CCC ATC GTG CTG GCC CAC GGC 
Gin Tfcr Lys Tyr Pro He Val Leu Ala His Gly 
5 \ 10 15 

AAC ATd CTT GGG GTC GAC TAC TGG TTC GGC ATT 
Asn Ile\Leu Gly Val Asp Tyr Trp Phe Gly He 
25 30 



CGT GAC i 
Arg Asp i 



GCC CAG GTC TAC GTC ACC GAA GTC 
I Ala Gin Val Tyr Val Thr Glu Val 
40 45 



AGC CAG 
Ser Gin 



TTG GAC ACC 
Leu Asp Thr 
50 



TCG GAA G^CGC GGC GAG CAG TTG CTG CAA CAG 
Ser Glu Val\Arg Gly Glu Gin Leu Leu Gin Gin 
55\ 60 



GTG GAG 
Val Glu 
65 

GGC CAC 
Gly His 
80 



GAA ATC GTC 
Glu He Val 



AGC CAC GGC 
Ser His Gly 



GCC CTC AGC ■ 
Ala Leu Ser ■ 



CAG CCC AAG GTC AAC CTG ATC 
ly Gin Pro Lys Val Asn Leu He 
75 



GGG CCG ACC ATC CGC TAC GTC GCC GCC GTA CGT 
Gly Pro Thr Il\ Arg Tyr Val Ala Ala Val Arg 
90 95 



CCC GAC 
Pro Asp 



TCG GAC 
Ser Asp 



GAG GCA 
Glu Ala 



CTG ATC GCT TCC GCC ACC AGC TGTC GGC GCC CCG CAC AAG GGT 
Leu He Ala Ser Ala Thr Ser Val Gly Ala Pro His Lys Gly 
100 l\p5 110 

ACC GCC GAC TTC CTG CGC CAG ATC CCA CCG GGT TCG GCC GGC 
Thr Ala Asp Phe Leu Arg Gin II* Pro Pro Gly Ser Ala Gly 
115 120 \ 125 

GTC CTC TCC GGG CTG GTC AAC AGC \CTC GGC GCG CTG ATC AGC 
Val Leu Ser Gly Leu Val Asn Ser Leu Gly Ala Leu He Ser 
130 135 \ 140 



TTC CTT 
Phe Leu 
145 



TCC AGC GGC 
Ser Ser Gly 



GGC ACC GGT ACG CAG . 
Gly Thr Gly Thr Gin . 
150 



&T TTA CTG GGC TCG CTG 
3n Leu Leu Gly Ser Leu 
155 



GAG TCG 
Glu Ser 
160 

CAG GGC 
Gin Gly 



GGC GTG 
Gly Val 



CTG AAC AGC 
Leu Asn Ser 



ATC CCC ACC 
He Pro Thr 
180 

AGC TAT TAC 
Ser Tyr Tyr 
195 



GAG GGT GCC GCG CGC TTQ AAC GCC AAG TAC CCG 
Glu Gly Ala Ala Arg PhelAsn Ala Lys Tyr Pro 
170\ 175 



TCG GCC TGC GGC GAA GGC ' 
Ser Ala Cys Gly Glu Gly . 

185 



TAC AAG GTC AAC 
Tyr Lys Val Asn 
190 



TCC TGG AGC GGT TCC TCG CQG CTG ACC AAC TTC 
Ser Trp Ser Gly Ser Ser Prb Leu Thr Asn Phe 
200 \ 205 
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CTC GAT CCG AGC GAC GCC TTC CTC GGC GCC TCG TCG CTG ACC TTC AAG 
Leu Asp Pro Ser As\p Ala Phe Leu Gly Ala Ser Ser Leu Thr Phe Lys 
210 \ 215 220 

AAC GGC ACC GCC McW GGC CTG GTC GGC ACC TGC AGT TCG CAC CTG 
Asn Gly Thr Ala Asn \sp Gly Leu Val Gly Thr Cys Ser Ser His Leu 
225 \ 230 235 

GGC ATG GTG ATC CGC GAC AAC TAC CGG ATG AAC CAC CTG GAC GAG GTG 
Gly Met Val lie Arg Asto Asn Tyr Arg Met Asn His Leu Asp Glu Val 
240 2A\ 250 255 

AAC CAG GTC TTC GGC CTC\ ACC AGC CTG TTC GAG ACC AGC CCG GTC AGC 
Asn Gin Val Phe Gly Leu\Thr Ser Leu Phe Glu Thr Ser Pro Val Ser 
260 \ 265 270 

GTC TAC CGC CAG CAC GCC AAC CGC CTG AAG AAC GCC AGC CTG 
Val Tyr Arg Gin His Ala Sen Arg Leu Lys Asn Ala Ser Leu 
275 \/} 280 285 

TAGGACCCCG GCCGGGGCCT CGGCl 



(2) INFORMATION FOR SEQ ID N®: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 311 aminb acids 

(B) TYPE: amino acid^ 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein ' 
(xi) SEQUENCE DESCRIPTION: SE\Q ID NO: 

Met Lys Lys Lys Ser Leu Leu Pro Lau Gly Leu Ala lie Gly Leu Ala 
-26 -25 -20 \ -15 

Ser Leu Ala Ala Ser Pro Leu lie Gln\Ala Ser Thr Tyr Thr Gin Thr 
-10 -5 

Lys Tyr Pro lie Val Leu Ala His Gly Met Leu Gly Phe Asp Asn lie 
10 15 \ 20 

Leu Gly Val Asp Tyr Trp Phe Gly lie Prfo Ser Ala Leu Arg Arg Asp 
25 30 \ 35 

Gly Ala Gin Val Tyr Val Thr Glu Val Ser\Gln Leu Asp Thr Ser Glu 
40 45 \ 50 

Val Arg Gly Glu Gin Leu Leu Gin Gin Val dlu Glu lie Val Ala Leu 
60 165 70 

Ser Gly Gin Pro Lys Val Asn Leu lie Gly Hj\s Ser His Gly Gly Pro 

i 85 

Thr lie Arg Tyr Val Ala Ala Val Arg Pro Asp\ Leu lie Ala Ser Ala 
90 95 \ 100 

Thr Ser Val Gly Ala Pro His Lys Gly Ser Asp thr Ala Asp Phe Leu 
105 110 \ 115 



Arg Gin lie Pro Pro Gly Ser Ala Gly Glu Ala Val Leu Ser Gly Leu 
120 125 13 
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Val Asn Ser 
135 



Gly Gly Thr 
150 



Ser Glu Gly 
165 



Leu Gly Ala 
215 



Leu Val Gly 
Tyr Arg Met 



Ser Leu Phe 
265 



Arg Leu Lys 
280 




(2) INFORMATION FOR SEQ ID NO: 9 

(i) SEQUENCE CHARACTERISTICS \ 

(A) LENGTH: 1047 base paVrs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown\ 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 84. .1016 

(ix) FEATURE: 

(A) NAME /KEY : mat_peptide 

(B) LOCATION: 162. .1016 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:\9: 
GGATCCCCGG TTCTCCCGGA AGGATTCGGG CGATGGCTGIf CAGGACGCGC CCCTCGGCCC 60 



CATCAACCTG AGATGAGAAC AAC ATG AAG AAG AAG 

Met Lys Lys Lys S^ 



GGC CTG GCC ATC GGT CTC GCC TCT CTC GCT GCC 
Gly Leu Ala He Gly Leu Ala Ser Leu Ala Ala 
-15 -10 



T CTG CTC CCC CTC 110 
t r Leu Leu Pro Leu 
-20 

IC CCT CTG ATC CAG 158 
S&r Pro Leu He Gin 
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GCC AGC 
Ala Ser 



ACC TAC\ 
Thr Tyr 



ACC CAG ACC 
'hr Gin Thr 
5 



AAA TAC 
Lys Tyr 



CCC ATC GTG CTG 
Pro lie Val Leu 



GCC CAC 
Ala His 



GGC 
Gly 



ATG CTC 
Met Leu 



GGC TTC < 
Gly Phe . 



CTC GGG 
Leu Gly 



GTC GAC TAC TGG 
Val Asp Tyr Trp 
25 



TTC GGC 
Phe Gly 



ATT 

He . 



CCC' AGC 
Pro Ser 



GCC TTG 
Ala Leu 
35 



CGC i 
Arg AA 



GAC 
Asp 



GGT GCC 
Gly Ala 



CAG GTC TAC GTC 
Gin Val Tyr Val 



ACC GAA 
Thr Glu 
45 



GTC 
Val 



AGC CAG 
Ser Gin 



TTG GAC , 
Leu Asp 1 



. GTC CGC 
Val Arg 



GGC GAG CAG TTG 
Gly Glu Gin Leu 



CTG CAA 
Leu Gin 



CAG 
Gin 



GTG GAG 
Val Glu 
65 



GAA ATC 
Glu He 



GTC GCC CTC 
Val Ala Leu\ 
70 



AGC GGC 
Ser Gly 



CAG CCC AAG GTC 
Gin Pro Lys Val 



AAC CTG ATC 
Asn Leu He 



GGC CAC 
Gly His 
80 



AGC CAC 
Ser His 



GGC GGG CCG . 
Gly Gly Pro 
85 



4 ATC 

He 



CGC TAC GTC GCC 
Arg Tyr Val Ala 
90 



GCC GTA CGT 
Ala Val Arg 
95 



CCC GAC 
Pro Asp 



CTG ATC 
Leu He 



GCT TCC GCC . 
Ala Ser Ala 
100 



GTC GGC GCC CCG 
Val Gly Ala Pro 
105 



CAC AAG GGT 
His Lys Gly 
110 



TCG GAC 
Ser Asp 



ACC GCC 
Thr Ala 
115 



GAC TTC CTG 
Asp Phe Leu 



CGC CA\ 

Arg Gli 
120 1 



ATC CCA CCG GGT 
:ie Pro Pro Gly 



TCG GCC GGC 
Ser Ala Gly 
125 



GAG GCA 
Glu Ala 



GTC CTC 
Val Leu 
130 



TCC GGG CTG 
Ser Gly Leu 



GTC AAC 
Val Asn 
135 



\GC CTC 
Ster Leu 



GGC GCG 
Gly Ala 
140 



CTG ATC AGC 
Leu He Ser 



TTC CTT 
Phe Leu 
145 



TCC AGC 
Ser Ser 



GGC AGC 
Gly Ser 



GGT ACG 
Gly Thr 



CAG\ AAT 
Gln\Asn 



TCA CTG 
Ser Leu 
155 



GGC TCG CTG 
Gly Ser Leu 



GAG TCG 
Glu Ser 
160 



CTG AAC 
Leu Asn 



AGC GAG 
Ser Glu 
165 



GGT 
Gly 



GCC GCG 
Ala Ala 



CGC TTC 
Arg Pile 



AAC GCC 
Asn Ala 



AAG 
Lys 



TAC CCG 
Tyr Pro 
175 



CAG GGC 
Gin Gly 



ATC CCC 
He Pro 



ACC TCG 
Thr Ser 
180 



GCC 
Ala 



TGC GGC 
Cys Gly 



GAA GGC 
Glu Gly' 
185 



GCC TAC 
la Tyr 



GTC AAC 
Val Asn 
190 



GGC GTG 
Gly Val 



AGC TAT 
Ser Tyr 
195 



TAC TCC 
Tyr Ser 



AGC GGT 
Ser Gly 
200 



TCC TCG 
Ser Ser 



CCG CTG 
PAo Leu 



ACC 
Thr 
205 



AAC TTC 
Asn Phe 



CTC GAT 
Leu Asp 



CCG AGC 
Pro Ser 
210 



GAC GCC TTC 
Asp Ala Phe 



CTC GGC ' 
Leu Gly . 
215 



ACC 
Thr 



TTC AAG 
Phe Lys 



AAC GGC 
Asn Gly 
225 



ACC GCC 
Thr Ala 



AAC GAC GGC 
Asn Asp Gly 
230 



CTG GTC 
Leu Val 



GGC ACC TGC AST 
Gly Thr Cys Se\ 
235 



TCG CAC CTG 
Ser His Leu 



GGC ATG GTG fflC CGC GAC AAC TAC CGG ATG AAC CAC CTG GAC GAG GTG 
Gly Met Val l\e Arg Asp Asn Tyr Arg Met Asn His Leu Asp Glu Val 
?45 250 255 

AAC CAG GTC TTC^GC CTC ACC AGC CTG TTC GAG ACC AGC CCG GTC AGC 
sn Gin Val Phe Sly Leu Thr Ser Leu Phe Glu Thr Ser Pro Val Ser 
2>60 265 270 

GTC TAC CGC CAG CAQ GCC AAC CGC CTG AAG AAC GCC AGC CTG 
Val Tyr Arg Gin HisYAla Asn Arg Leu Lys Asn Ala Ser Leu 
275 \ 280 285 

TAGGACCCCG GCCGGGGCCT\CGGCCCGGGC C 



(2) INFORMATION FOR SEQXID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31l\amino acids 

(B) TYPE: amino Vcid 
(D) TOPOLOGY: liri\ 



(ii) MOLECULE TYPE: 
(xi) SEQUENCE DESCRIPTld 



SEQ ID NO: 10: 



Met Lys Lys Lys Ser Leu Leu Pr® Leu Gly Leu Ala lie Gly Leu Ala 
-26 -25 -20 \ -15 

Ser Leu Ala Ala Ser Pro Leu lie teln Ala Ser Thr Tyr Thr Gin Thr 

1 5 



Lys Tyr Pro lie Val Leu Ala His G1V Met Leu Gly Phe Asp Asn lie 

10 15\ 20 

Leu Gly Val Asp Tyr Trp Phe Gly lie Sro Ser Ala Leu Arg Arg Asp 

25 30 \ 35 

Gly Ala Gin Val Tyr Val Thr Glu Val Selt Gin Leu Asp Thr Ser Glu 



40 



50 



Val Arg Gly Glu Gin Leu Leu Gin Gin Val <51u Glu lie Val Ala Leu 

55 60 \5 70 

Ser Gly Gin Pro Lys Val Asn Leu lie Gly His Ser His Gly Gly Pro 

75 80 



Thr lie Arg Tyr Val Ala Ala Val Arg Pro Asp YLeu lie Ala Ser Ala 

90 95 \ 100 

Thr Ser Val Gly Ala Pro His Lys Gly Ser Asp Tftr Ala Asp Phe Leu 

105 110 \ 115 

Arg Gin lie Pro Pro Gly Ser Ala Gly Glu Ala Val\Leu Ser Gly Leu 

120 125 130\ 

Val Asn Ser Leu Gly Ala Leu lie Ser Phe Leu Ser s\er Gly Ser Thr 

135 140 145 \ 150 

Gly Thr Gin Asn Ser Leu Gly Ser Leu Glu Ser Leu Asn\Ser Glu Gly 

155 160 \ 165 
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Ala Ala 
Cys Gly 



Arg Phe Jsn Ala Lys Tyr 
170 V 



Pro Gin ' 
175 



Ser Gly 
200 



Leu Gly 
215 



Arg Leu 
280 



Glu Gly Alk Tyr 
185 

Ser Ser Pro leu 

Ala Ser Ser Le 
22fc 

Gly Thr Cys Ser\ 
235 

Met Asn His Leu 
250 

Phe Glu Thr Ser 
265 

Lys Asn Ala Ser 



Lys Val 
190 



Thr Asn 
205 



Gly He 
Asn Gly Val Ser 



Pro Thr 
180 



Tyr Tyr 
195 



Asp Pro 
210 



Thr Phe Lys Asn Gly Thr 
225 



Leu Gly 
240 



Ser Asp 
Ala Asn 
Met Val He Arg 



Val Asn < 
255 



Gin Val 

Pro\Val Ser Val Tyr Arg 



Phe Gly 
260 



Gin His 
275 



Ser Ala 
Ser Trp 
Ala Phe 



Asp Gly 
230 



Asp Asn 
245 



Leu Thr 
Ala Asn 



Leu 
285 



(2) INFORMATION FOR SEQ ID NO: l\ 

(i) SEQUENCE CHARACTERISTICS \ 

(A) LENGTH: 1049 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown\ 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA ( genomic )\ 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 85. . 1017 

(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 163. .1017 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: lV 

GGATCCCCCG GTTCTCCCGG AAGGATTCGG GCGATGGCTG yCAGGACGCG CCCCTCGGCC 

CCATCAACCT GAGATGAGAA CAAC ATG AAG AAG AAG TcV CTG CTC CCC CTC 
Met Lys Lys Lys Sei Leu Leu Pro Leu 
-26 -25 

GGC CTG GCC ATC GGT CTC GCC TCT CTC GCT GCC AGc\ CCT CTG ATC CAG 
Gly Leu Ala He Gly Leu Ala Ser Leu Ala Ala SerXPro Leu He Gin 
-15 -10 

GCC AGC ACC TAC ACC CAG ACC AAA TAC CCC ATC GTG C^G GCC CAC GGC 
Ala Ser Thr Tyr Thr Gin Thr Lys Tyr Pro He Val L\u Ala His Gly 
1 5 10 

ATG CTC GGC TTC GAC AAC ATC CTT GGG GTC GAC TAC TGg\tTC GGC ATT 
Met Leu Gly Phe Asp Asn He Leu Gly Val Asp Tyr Trp\Phe Gly He 
20 — * ' 
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CCC AGC GCC 
Pro Ser Ala 



TTG SGC 
Leu A\g 

35 



CGT GAC 
Arg Asp 



GGT GCC CAG 
Gly Ala Gin 



GTC TAC 
Val Tyr 



GTC ACC GAA GGC 
Val Thr Glu Gly 
45 



AGC CAG TTG 
Ser Gin Leu 



GAC ACC' 
Asp Thr 



TCG GAA 
er Glu 



GTC CGC GGC 
Val Arg Gly 



GAG CAG 
Glu Gin 



TTG CTG CAA CAG 
Leu Leu Gin Gin 
60 



GTG GAG GAA 
Val Glu Glu 
65 



ATC GTC i 
He Val . 



CTC 
Leu 



AGC GGC CAG 
Ser Gly Gin 



CCC AAG 
Pro Lys 
75 



GTC AAC CTG ATC 
Val Asn Leu lie 



GGC CAC AGC 
Gly His Ser 



CAC GGC 
His Gly 



GGG \CG 
Gly 
85 



ACC ATC CGC 
Thr He Arg 



TAC GTC 
Tyr Val 
90 



GCC GTA CGT 
Ala Val Arg 



CCC GAC CTG 
Pro Asp Leu 



ATC GCT 
He Ala 
100 



TCC GCC 
Ser Ala 



ACC AGC GTC 
(Thr Ser Val 
105 



GGC GCC 
Gly Ala 



CAC AAG GGT 
His Lys Gly 
110 



TCG GAC ACC 
Ser Asp Thr 



GCC GAC 
Ala Asp 
115 



TTC CTG i 
Phe Leu . 



f CAG ATC 
\Gln He 

r 120 



CCA CCG 
Pro Pro 



TCG GCC GGC 
Ser Ala Gly 
125 



GAG GCA 
Glu Ala 



CTC TCC 
Leu Ser 



GGG CTG 
Gly Leu 



CTC GGC 
Leu Gly . 



CTG ATC AGC 
Leu He Ser 



TTC CTT 
Phe Leu 
145 



AGC GGC 
Ser Gly 



GGC ACC 
Gly Thr 
150 



GGT ACA CAG 
Gly Thr\Gln 



AAT TTA 
Asn Leu 
155 



GGC TCG CTG 639 
Gly Ser Leu 



GAG TCG 
Glu Ser 
160 



CTG 
Leu 



AAC AGC 
Asn Ser 



GAG GGT 
Glu Gly 
165 



GCC GCG C($C 
Ala Ala Ai 



TTC AAC 
Phe Asn 
170 



GCC AAG 
Ala Lys 



TAC CCG 
Tyr Pro 
175 



CAG GGC 
Gin Gly 



ATC 
He 



CCC ACC 
Pro Thr 
180 



TCG GCC 
Ser Ala 



TGC GGC GAA \ 
Cys Gly Glu 
185 



JGC GCC 
;iy Ala 



TAC AAG 
Tyr Lys 



GTC AAC 
Val Asn 
190 



GGC GTG AGC 
Gly Val Ser 



TAT TAC 
Tyr Tyr 
195 



TCC TGG 
Ser Trp 



AGC GGT TCC 
Ser Gly Ser 
200 



TCE CCG 
SeA Pro 



CTG ACC 
Leu Thr 
205 



AAC TTC 
Asn Phe 



CTC GAT 
Leu Asp 



AGC GAC 
Ser Asp 



GCC TTC 
Ala Phe 



CTC GGC GCC 
Leu Gly Ala 
215 



TCG \CQ 
Ser 



CTG ACC 
Leu Thr 
220 



TTC AAG 
Phe Lys 



AAC GGC . 
Asn Gly 
225 



GCC AAC 
Ala Asn 



GAC GGC 
Asp Gly 
230 



CTG GTC GGC 
Leu Val Gly 



ACC TGC 
Thr Cys\ 
235 



AGT TCG 
Ser Ser 



CAC CTG 
His Leu 



GGC ATG 
Gly Met 
240 



GTG 
Val 



ATC CGC 
He Arg 



GAC AAC 
Asp Asn 
245 



TAC CGG ATG 
Tyr Arg Met 



AAC CAC 
Asn His 
250 



ITG GAC GAG GTG 927 
Lteu Asp Glu Val 
255 



AAC CAG 
Asn Gin 



GTC 
Val 



TTC GGC 
Phe Gly 
260 



CTC ACC 
Leu Thr 



AGC CTG TTC 
Ser Leu Phe 
265 



GAG ACC 
Glu Thr 



AGC\ CCG GTC AGC 
Ser\Pro Val Ser 
270 
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GTC TAC CGC CAG CAC GCC AAC CGC CTG AAG AAC GCC AGC CTG 
Val Tyr Arg Gin His Al(a Asn Arg Leu Lys Asn Ala Ser Leu 
275 \ 280 285 

TAGGACCCCG GCCGGGGCCT C&GCCCGGGC CC 



(2) INFORMATION FOR SEQ lb NO: 12: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 311 abino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protkin 

(xi) SEQUENCE DESCRIPTIONS SEQ ID NO: 12: 

Met Lys Lys Lys Ser Leu Leu Pro Leu Gly Leu Ala lie Gly Leu Ala 

-26 -25 -20 Y« -15 

Ser Leu Ala Ala Ser Pro Leu IlffTEln Ala Ser Thr Tyr Thr Gin Thr 

-10 -5 *-A 1 5 

Lys Tyr Pro lie Val Leu Ala His GVLy Met Leu Gly Phe Asp Asn lie 
10 15 20 

Leu Gly Val Asp Tyr Trp Phe Gly lift Pro Ser Ala Leu Arg Arg Asp 
25 30 \ 35 

Gly Ala Gin Val Tyr Val Thr Glu Gly Ver Gin Leu Asp Thr Ser Glu 
40 45 \ 50 

Val Arg Gly Glu Gin Leu Leu Gin Gin Vad Glu Glu lie Val Ala Leu 
55 60 \ 65 70 

Ser Gly Gin Pro Lys Val Asn Leu lie Gly\His Ser His Gly Gly Pro 
75 80 \ 85 

Thr lie Arg Tyr Val Ala Ala Val Arg Pro Atep Leu lie Ala Ser Ala 
90 95 \ 100 

Thr Ser Val Gly Ala Pro His Lys Gly Ser AsA Thr Ala Asp Phe Leu 
105 110 \ 115 

Arg Gin lie Pro Pro Gly Ser Ala Gly Glu Ala Val Leu Ser Gly Leu 
120 125 130 

Val Asn Ser Leu Gly Ala Leu lie Ser Phe Leu S\r Ser Gly Gly Thr 
135 140 145 \ 150 

Gly Thr Gin Asn Leu Leu Gly Ser Leu Glu Ser LedAsn Ser Glu Gly 
155 160 \ 165 

Ala Ala Arg Phe Asn Ala Lys Tyr Pro Gin Gly lie Pro Thr Ser Ala 
170 175 \ 180 

Cys Gly Glu Gly Ala Tyr Lys Val Asn Gly Val Ser TUr Tyr Ser Trp 
185 190 19t> 



Ser Gly Ser Ser Pro Leu Thr Asn Phe Leu Asp Pro Ser\ Asp Ala Phe 
200 205 210 
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Leu Gly Ala Ser Ser Lek Thr Phe Lys Asn Gly Thr Ala Asn Asp Gly 
215 220\ 225 230 

Leu Val Gly Thr Cys Ser ^Ber His Leu Gly Met Val lie Arg Asp Asn 
235 \ 240 245 

Tyr Arg Met Asn His Leu AsV> Glu Val Asn Gin Val Phe Gly Leu Thr 
250 \ 255 260 

Ser Leu Phe Glu Thr Ser Pro Val Ser Val Tyr Arg Gin His Ala Asn 
265 \70 275 

Arg Leu Lys Asn Ala Ser Leu 
280 285 



(2) INFORMATION FOR SEQ ID NO: 

(i) SEQUENCE CHARACTERISTICS^ 

(A) LENGTH: 1050 base P^i,rs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unkn 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic^ 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 8 5. .1017 

(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 163. . 1017 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 
GGATCCCCCG GTTCTCCCGG AAGGATTCGG GCGATGGCT& GCAGGACGCG CCCCTCGGCC 



CCATCAACCT GAGATGAGAA CAAC ATG AAG AAG AAG ' 
Met Lys Lys Lys 
-26 -25 



CTG CTC CCC CTC 
Leu Leu Pro Leu 
-20 



GGC CTG GCC ATC GGT CTC GCC TCT CTC GCT GCC AGC CCT CTG ATC CAG 
Gly Leu Ala lie Gly Leu Ala Ser Leu Ala Ala SeV Pro Leu lie Gin 



GCC AGC ACC TAC ACC CAG ACC AAA TAC CCC ATC GTG\ CTG GCC CAC GGC 
Ala Ser Thr Tyr Thr Gin Thr Lys Tyr Pro lie Val\Leu Ala His Gly 



ATG CTC GGC TTC GAC AAC ATC CTT GGG GTC GAC TAC TGG TTC GGC ATT 

Met Leu Gly Phe Asp Asn lie Leu Gly Val Asp Tyr Tvrp Phe Gly lie 
20 25 

CCC AAC GCC TTG CGC CGT GAC GGT GCC CAG GTC TAC GTt ACC GAA GGC 

Pro Asn Ala Leu Arg Arg Asp Gly Ala Gin Val Tyr Val Thr Glu Gly 



AGC CAG TTG GAC ACC TCG GAA GTC CGC GGC GAG CAG TTGX]TG CAA CAG 
Ser Gin Leu Asp Thr Ser Glu Val Arg Gly Glu Gin Leu Leu Gin Gin 
50 55 60 
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GTG GAG GAA ATC GTC GC& CTC AGC GGC CAG 
Val Glu Glu He Val Ala\Leu Ser Gly Gin 
65 \ 70 

GGC CAC AGC CAC GGC GGG QCG ACC ATC CGC 
Gly His Ser His Gly Gly p\o Thr He Arg 

80 f " 

CCC GAC CTG ATC GCT TCC GCC\ACC AGC GTC 
Pro Asp Leu He Ala Ser Ala VThr Ser Val 
100 \ 105 

TCG GAC ACC GCC GAC TTC CTG dpC CAG ATC 
Ser Asp Thr Ala Asp Phe Leu Arg Gin He 
115 \ 120 

GAG GCA GTC CTC TCC GGG CTG GTC AAC AGC 
Glu Ala Val Leu Ser Gly Leu Vai Asn Ser 
130 - -- - 

TTC CTT TCC AGC GGC GGC ACC GG'. 
Phe Leu Ser Ser Gly Gly Thr G1 J 
145 150 

'GAG TCG CTG AAC AGC GAG GGT GCC G< 
Glu Ser Leu Asn Ser Glu Gly Ala 
160 165 



CCC AAG 
Pro Lys 



TAC GTC 
Tyr Val 



GGC GCC 
Gly Ala 



CCA CCG 
Pro Pro 



GTC AAC CTG ATC 
Val Asn Leu He 



GCC GCC 
Ala Ala 



CCG CAC 
Pro His 



GGT TCG 
Gly Ser 
125 

GCG CTG 
Ala Leu 
140 



GTA CGT 
Val Arg 



AAG GGT 
Lys Gly 
110 

GCC GGC 
Ala Gly 



ATC AGC 
He Ser 



TCG CTG 
Ser Leu 



TAC CCG 
Tyr Pro 
175 



CAG GGC ATC CCC ACC TCG GCC TGC GGC 
Gin Gly He Pro Thr Ser Ala Cys Gly 

180 A85 

GGC GTG AGC TAT TAC TCC TGG AGC GGT TQC 

Gly Val Ser Tyr Tyr Ser Trp Ser Gly Se: 
195 

CTC GAT CCG AGC GAC GCC TTC CTC GGC GCC 

Leu Asp Pro Ser Asp Ala Phe Leu Gly Ala 

210 215 

AAC GGC ACC GCC AAC GAC GGC CTG GTC GGC 
Asn Gly Thr Ala Asn Asp Gly Leu Val Gly 
225 230 

GGC ATG GTG ATC CGC GAC AAC TAC CGG ATG 
Gly Met Val He Arg Asp Asn Tyr Arg Met 
240 245 

AAC CAG GTC CTC GGC CTC ACC AGC CTG TTC 
Asn Gin Val Leu Gly Leu Thr Ser Leu Phe 
260 265 

GTC TAC CGC CAG CAC GCC AAC CGC CTG AAG 
Val Tyr Arg Gin His Ala Asn Arg Leu Lys 
275 280 

TAGGACCCCG GCCGGGGCCT CGGCCCGGGC CCG 




AGC CCG GTC AGC 
Ser Pro Val Ser 
270 

AGC CTG 
Ser Leu 
285 
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(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 311 amino acids 

(B) TYPE\ amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TMPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Met Lys Lys Lys Ser ]\eu Leu Pro Leu Gly Leu Ala He Gly Leu Ala 
-26 -25 \ -20 -15 



Ser Leu Ala Ala Ser 
-10 



Prb Leu He Gin Ala Ser Thr 



Tyr Thr Gin Thr 
5 



Lys Tyr Pro He Val Leu Via His Gly Met Leu Gly 

15 



Phe Asp Asn He 
20 

Leu Arg Arg Asp 
35 

Gly Ala Gin Val Tyr Val Th^Glu Gly Ser Gin Leu Asp Thr Ser Glu 



Leu Gly Val Asp Tyr Trp Pfte Gly He Pro Asn Ala 
30 



Val Arg Gly Glu Gin Leu Leu Gin Gin Val Glu Glu 

55 60 \ 65 

Ser Gly Gin Pro Lys Val Asn Le\ lie Gly His Ser 
75 

Thr He Arg Tyr Val Ala Ala Val Vrg Pro Asp Leu 
90 

Thr Ser Val Gly Ala Pro His Lys G\y Ser Asp Thr 

105 110 

Arg Gin He Pro Pro Gly Ser Ala Gly\Glu Ala Val Leu Ser Gly Leu 

120 125 \ 130 



He Val Ala Leu 
70 



His Gly Gly Pro 
85 



lie Ala Ser Ala 
100 



Ala Asp Phe Leu 
115 



Val Asn Ser Leu Gly Ala Leu He Ser Phe Leu Ser 
135 140 \ 145 



Gly Thr Gin Asn Leu Leu Gly Ser Leu GlVi Ser Leu 
155 " 16C 



Ala Ala Arg Phe Asn Ala Lys Tyr Pro GlnXGly He 
170 175 



Cys Gly Glu Gly Ala Tyr Lys Val Asn Gly Vial Ser 
185 190 



Ser Gly Gly Thr 
150 



Asn Ser Glu Gly 
165 



Pro Thr Ser Ala 
180 



Tyr Tyr Ser Trp 
195 



Ser Gly Ser Ser Pro Leu Thr Asn Phe Leu Asb Pro Ser Asp Ala Phe 
200 205 \ 210 



Leu Gly Ala Ser Ser Leu Thr Phe Lys Asn Gly\Thr 
215 220 225 \ 



Leu Val Gly Thr Cys Ser Ser His Leu Gly Met \al 
235 240 



Ala Asn Asp Gly 
230 



He Arg Asp Asn 
245 
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Tyr Arg Met Ash. His Leu Asp Glu Val Asn Gin Val Leu Gly Leu Thr 
250. 255 260 

Ser Leu Phe Glu Vhr Ser Pro Val Ser Val Tyr Arg Gin His Ala Asn 
265 \ 270 275 

Arg Leu Lys Asn Al\ Ser Leu 
280 \ 285 



(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: lY)49 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNEbS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME /KEY: CDS^ 

(B) LOCATION : 85. . 1 

(ix) FEATURE: 

(A) NAME /KEY : mat_pefotide 

(B) LOCATION: 163. . 10W 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

GGATCCCCCG GTTCTCCCGG AAGGATTCGG (SCGATGGCTG GCAGGACGCG CCCCTCGGCC 

CCATCAACCT GAGATGAGAA CAAC ATG AAG\AAG AAG TCT CTG CTC CCC CTC 
Met Lys Lys Lys Ser Leu Leu Pro Leu 
-26 -25 \ ' -20 

GGC CTG GCC ATC GGT CTC GCC TCT CTC GtT GCC AGC CCT CTG ATC CAG 
Gly Leu Ala lie Gly Leu Ala Ser Leu AJa Ala Ser Pro Leu lie Gin 
-15 ~ -10 \ -5 

GCC AGC ACC TAC ACC CAG ACC AAA TAC CCcWc GTG CTG GCC CAC GGC 
Ala Ser Thr Tyr Thr Gin Thr Lys Tyr Pro Vie Val Leu Ala His Gly 
1 5 UO 15 

ATG CTC GGC TTC GAC AAC ATC CTT GGG GTC GAC TAC TGG TTC GGC ATT 
Met Leu Gly Phe Asp Asn lie Leu Gly Val Asto Tyr Trp Phe Gly lie 
20 25 \ 30 

CCC AGC GCC TTG CGC CGT GAC GGT GCC CAG GTc\ TAC GTC ACC GAA GGC 
Pro Ser Ala Leu Arg Arg Asp Gly Ala Gin Val\Tyr Val Thr Glu Gly 
35 40 \ 45 

AGC CAG TTG GAC ACC TCG GAA GTC CGC GGC GAG (3AG TTG CTG CAA CAG 
Ser Gin Leu Asp Thr Ser Glu Val Arg Gly Glu GVLn Leu Leu Gin Gin 
50 " 55 \ 60 



GTG GAG GAA ATC GTC GCC CTC AGC GGC CAG CCC AAG 
Val Glu Glu He Val Ala Leu Ser Gly Gin Pro Ly3 
65 70 



GTC AAC CTG ATC 
Val Asn Leu He 



GGC CAC AGC CAC GGC GGG CCG ACC ATC CGC 
Gly His Ser Hife Gly Gly Pro Thr lie Arg 



CCC GAC CTG ATC GCT TCC GCC ACC AGC GTC 
Pro Asp Leu lie AAa Ser Ala Thr Ser Val 

105 

TCG GAC ACC GCC GACYtTC CTG CGC CAG ATC 
Ser Asp Thr Ala Asp Vhe Leu Arg Gin lie 
120 

GAG GCA GTC CTC TCC GGfe CTG GTC AAC AGC < 
Glu Ala Val Leu Ser Gly\ Leu Val Asn Ser : 



TTC CTT TCC AGC GGC GGC 

Phe Leu Ser Ser Gly Gly 
145 

GAG TCG CTG AAC AGT GAG GG' 

Glu Ser Leu Asn Ser Glu Gl- 

160 165 

CAG GGC ATC CCC ACC TCG GCC TGfc GGC GAA < 

Gin Gly lie Pro Thr Ser Ala CysXdy Glu > 
180 \ 185 

GGC GTG AGC TAT TAC TCC TGG AGC 
Gly Val Ser Tyr Tyr Ser Trp Ser 
195 

CTC GAT CCG AGC GAC GCC TTC CTC GGC\GCC 1 
Leu Asp Pro Ser Asp Ala Phe Leu Gly Via 
210 * 215 " " 

AAC GGC ACC GCC AAC GAC GGC CTG GTC G( 
Asn Gly Thr Ala Asn Asp Gly Leu Val Gl 
225 230 

GGC ATG GTG ATC CGC GAC AAC TAC CGG ATG ' 
Gly Met Val lie Arg Asp Asn Tyr Arg Met . 
240 245 

AAC CAG GTC CTC GGC CTC ACC AGC CTG TTC 
Asn Gin Val Leu Gly Leu Thr Ser Leu Phe ' 
260 265 

GTC TAC CGC CAG CAC GCC AAC CGC CTG AAG . 
Val Tyr Arg Gin His Ala Asn Arg Leu Lys . 

275 280 

TAGGACCCCG GCCGGGGCCT CGGCCCGGGC CC 



(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 311 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



TAC GTC 
Tyr Val 
90 

GGC GCC 
Gly Ala 



CCA CCG 
Pro Pro 



GCC GCC 
Ala Ala 



CCG CAC 
Pro His 



GGT TCG 
Gly Ser 
125 



GTA CGT 
Val Arg 
95 

AGG GGT 
Arg Gly 
110 

GCC GGC 
Ala Gly 



ATC AGC 
He Ser 




CAC CTG 
His Leu 



AGC CCG GTC AGC 
Ser Pro Val Ser 
270 
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(ii) MOLECl 
(xi) SEQUENT 



,E TYPE: protein 

DESCRIPTION: SEQ ID NO: 16: 



Met Lys Lys Lys 
-26 -25 



ler Leu Leu Pro Leu Gly Leu Ala lie Gly Leu Ala 
-20 -15 



Ser Leu Ala Ala 
-10 



SeA Pro Leu lie Gin Ala Ser Thr Tyr 
1 



Lys Tyr Pro lie Val Lteu Ala His Gly Met Leu Gly Phe 



Thr Gin Thr 
5 



Asp Asn lie 
20 



Leu Gly Val Asp 
25 



Tyr TrpXPhe Gly lie Pro Ser Ala Leu Arg Arg Asp 
30 35 



Gly Ala Gin Val Tyr Val Thr Glu Gly Ser Gin Leu Asp Thr Ser Glu 
40 A ~ 

Val Arg Gly Glu 
55 

Ser Gly Gin Pro Lys Val Asr^eV He Gly His Ser His 



Gin Leu LeuV31n Gin Val Glu Glu He Val Ala Leu 
60 



Gly Gly Pro 
85 



Thr He Arg Tyr Val Ala Ala Val \rg Pro Asp Leu He 
90 



Thr Ser Val Gly 
105 



Arg Gin He Pro 
120 



Ala Pro His Arg Gly Ser Asp Thr Ala 
110 \ 115 



Pro Gly Ser Ala Gly\Glu Ala Val Leu 
125 \ 130 



Ala Ser Ala 
100 



Asp Phe Leu 
Ser Gly Leu 



Val Asn Ser Leu 
135 



Gly Ala Leu He Ser Pbe Leu Ser Ser 
140 \ 145 



Gly Thr Gin Asn Leu Leu Gly Ser Leu Glu\ Ser Leu Asn 
155 160> 



Ala Ala Arg Phe 
170 



Cys Gly Glu Gly 
185 



Ser Gly Ser Ser 
200 



Leu Gly Ala Ser 
215 



Asn Ala Lys Tyr Pro Gin ffily He Pro 
175 



Ala Tyr Lys Val Asn Gly Va\l Ser Tyr 
190 \ 195 



Pro Leu Thr Asn Phe Leu Asp\ Pro Ser 
205 \210 



Ser Leu Thr Phe Lys Asn Gly Thr Ala 
220 225 



Leu Val Gly Thr Cys Ser Ser His Leu Gly Met Val He 
235 240 



Tyr Arg Met Asn 
250 



Ser Leu Phe Glu 
265 



Arg Leu Lys Asn 
280 



His Leu Asp Glu Val Asn Gin Va]\ Leu 
255 



Gly Gly Thr 
150 



Ser Glu Gly 
165 



Thr Ser Ala 
180 



Tyr Ser Trp 
Asp Ala Phe 



Asn Asp Gly 
230 



Arg Asp Asn 
245 



Gly Leu Thr 
260 



Thr Ser Pro Val Ser Val Tyr Arg Gin His Ala Asn 
270 115 



Ala Ser Leu 
285 



- 58 - 



(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1049 base pairs 

(B) TY^E: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/KE? 

(B) LOCATION 



CDS 
85. .1017 



(ix) FEATURE: 

(A) NAME/KEY : \mat_peptide 

(B) LOCATION: 1^3. . 1017 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
GGATCCCCCG GTTCTCCCGG AAGGAXTTSGG GCGATGGCTG GCAGGACGCG CCCCTCGGCC 



CCATCAACCT GAGATGAGAA CAAC 



; AAG AAG AAG TCT CTG CTC CCC CTC 
Lys Lys Lys Ser Leu Leu Pro Leu 
-20 



60 
111 



GGC CTG GCC ATC GGT CTC GCC TCT\ CTC GCT GCC AGC CCT CTG ATC CAG 
Gly Leu Ala lie Gly Leu Ala Ser\Leu Ala Ala Ser Pro Leu lie Gin 
-15 -10 \ 

GCC AGC ACC TAC ACC CAG ACC AAA T^AC CCC ATC GTG CTG GCC CAC GGC 
Ala Ser Thr Tyr Thr Gin Thr Lys tW Pro lie Val Leu Ala His Gly 

10 15 

ATG CTC GGC TTC GAC AAC ATC CTT GGG\GTC GAC TAC TGG TTC GGC ATT 
Met Leu Gly Phe Asp Asn lie Leu GlyWal Asp Tyr Trp Phe Gly lie 
20 \25 30 

CCC AGC GCC TTG CGC CGT GAC GGT GCC CAG GTC TAC GTC ACC GAA GGC 
Pro Ser Ala Leu Arg Arg Asp Gly Ala Gin Val Tyr Val Thr Glu Gly 
35 40 \ 45 

AGC CAG TTG GAC ACC TCG GAA GTC CGC GGC\GAG CAG TTG CTG CAA CAG 
Ser Gin Leu Asp Thr Ser Glu Val Arg Gly ffilu Gin Leu Leu Gin Gin 
50 55 \ 60 

GTG GAG GAA ATC GTC GCC CTC AGC GGC CAG C<3C AAG GTC AAC CTG ATC 
Val Glu Glu lie Val Ala Leu Ser Gly Gin Prb Lys Val Asn Leu He 
65 70 \ 75 

GGC CAC AGC CAC GGC GGG CCG ACC ATC CGC TAC\ GTC GCC GCC GTA CGT 
Gly His Ser His Gly Gly Pro Thr He Arg TyrWal Ala Ala Val Arg 
80 85 90 \ 95 

CCC GAC CTG ATC GCT TCC GCC ACC AGC GTC GGC GCC CCG CAC AAG GGT 
Pro Asp Leu He Ala Ser Ala Thr Ser Val Gly Ala Pro His Lys Gly 
100 105 \ 110 



TCG GAC ACC GCQ GAC TTC CTG CGC CAG ATC 
Ser Asp Thr Ala\Asp Phe Leu Arg Gin lie 
115 \ 120 



CCA CCG 
Pro Pro 



GGT TCG GCC 
Gly Ser Ala 
125 



GGC 
Gly 



GAG GCA GTC CTC TCC GGG CTG GTC AAC AGC 
Glu Ala Val Leu SeV Gly Leu Val Asn Ser 
130 \ " 135 



CTC GGC 
Leu Gly 



GCG CTG ATC 
Ala Leu He 
140 



AGC 

Ser _ 



TTC CTT TCC AGC GGC \ GGC ATC GGT ACG CAG 
Phe Leu Ser Ser Gly Cly He Gly Thr Gin 
145 \ 150 



AAT TTT 
Asn Phe 
155 



CTG GGC TCG 
Leu Gly Ser 



CTG 
Leu 



GAG TCG CTG AAC AGC GAG GGT GCC GCG CGC 
Glu Ser Leu Asn Ser Glu Gly Ala Ala Arg 
160 165\ 



TTC AAC 
Phe Asn 
170 



GCC AAG TAC 
Ala Lys Tyr 



CCG 
Pro 
175 



CAG GGC ATC CCC ACC TCG GCC TGC GGC GAA 
Gin Gly He Pro Thr Ser Ala Cys Gly Glu 
180 \fl 185 



GGC GCC 
Gly Ala 



TAC AAG GTC 
Tyr Lys Val 



AAC 
Asn 



GGC GTG AGC TAT TAC TCC TGGN^GC GGT TCC 
Gly Val Ser Tyr Tyr Ser Trp^er Gly Ser 
195 \ 200 



TCG CCG 
Ser Pro 



CTG ACC AAC 
Leu Thr Asn 
205 



TTC 
Phe 



CTC GAT CCG AGC GAC GCC TTC CTC GGC GCC 
Leu Asp Pro Ser Asp Ala Phe Lea Gly Ala 
210 " 215\ 



TCG TCG 
Ser Ser 



CTG ACC TTC 
Leu Thr Phe 
220 



AAG 
Lys 



AAC GGC ACC GCC AAC GAC GGC CTG GTC GGC 
Asn Gly Thr Ala Asn Asp Gly Leu vfel Gly 
225 230 



ACC TGC 
Thr Cys 
235 



AGT TCG CAC 
Ser Ser His 



CTG 879 
Leu 



GGC ATG GTG ATC CGC GAC AAC TAC CGG\ ATG 
Gly Met Val lie Arg Asp Asn Tyr Arg Vet 
240 245 



AAC CAC 
Asn His 
250 



CTG GAC GAG 
Leu Asp Glu 



GTG 
Val 
255 



AAC CAG GTC TTC GGC CTC ACC AGC CTG TT\C 
Asn Gin Val Phe Gly Leu Thr Ser Leu Ph\ 
260 



GAG ACC 
Glu Thr 



AGC CCG GTC 
Ser Pro Val 
270 



AGC 
Ser 



GTC TAC CGC CAG CAC GCC AAC CGC CTG AAG < 
Val Tyr Arg Gin His Ala Asn Arg Leu Lys 
275 280 



\C GCC 
ksn Ala 



AGC CTG 
Ser Leu 
285 



TAGGACCCCG GCCGGGGCCT CGGCCCGGGC CC 



(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 311 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 



Met Lys Lys Lys Ser Leu Leu Pro Leu Gly Leu Ala lie Gly Leu Ala 
-26 -25 -20 " -1" 



Ser Leu Ala Ala Ser Pro Leu He Gin Ala Ser Thr Tyr Thr Gin Thr 
-10 -5 1 ' 5 



Lys Tyr Pro lie Val Leu 



Ala His Gly Met 
15 



Leu Gly Val Asp\ 
25 

Gly Ala Gin Val 
40 

Val Arg Gly Glu 
55 

Ser Gly Gin Pro 



Phe Gly He Pro 
30 



Tyr Trp 

r Val Thr Glu Gly Ser 
45 

Glr\ Leu Leu Gin Gin Val 



60 



Lys Val 
75 



Asn Leu lie Gly 
80 



Thr He Arg Tyr 
90 



Thr Ser Val Gly . 
105 



Arg Gin He Pro 
120 



\Ala Val Arg Pro 
\ 95 

H\s/]Lys Gly Ser 



Leu Gly Phe 

Ser Ala Leu 
35 

Gin Leu Asp 
50 

Glu Glu He 
65 

His Ser His 
Asp Leu He 



Asp Thr Ala 
115 



Ala Val Leu 
130 



Val Asn Ser Leu 
135 



Gly Thr Gin Asn 



Gly Ala 
140 



Phe Leu 
155 



Leu lie Ser Phe 



Gly Seft Leu Glu 
160 



Leu Ser Ser 
145 



Ser Leu Asn 



Ala Ala Arg Phe 
170 



Lys Tyr Pro Gin Gly He Pro 
1\75 



Cys Gly Glu Gly Ala Tyr 
185 



Lys Val Ask Gly 
190 



Ser Gly Ser Ser 
200 



Leu Gly Ala Ser 
215 



Thr Asn Phe I^eu 
205 



Ser Leu 
220 



Thr Phe Lys Asft 



Val Ser Tyr 
195 



Asp Pro Ser 
210 



Gly Thr Ala 
225 



Leu Val Gly Thr 



Cys Ser 
235 



Ser His Leu Gly 
240 



let Val He 



Tyr Arg Met Asn 
250 



Ser Leu Phe Glu 
265 



Arg Leu Lys Asn 
280 



Asp Glu Val Asn 
255 



Pro Val Ser Val 
270 



GAn Val Phe 



Asp Asn He 
20 

Arg Arg Asp 

Thr Ser Glu 

Val Ala Leu 
70 

Gly Gly Pro 
85 

Ala Ser Ala 
100 

Asp Phe Leu 

Ser Gly Leu 

Gly Gly He 
150 

Ser Glu Gly 
165 

Thr Ser Ala 
180 

Tyr Ser Trp 
Asp Ala Phe 



Asn Asp Gly 

230 



Arg Asp Asn 
245 



Gly Leu Thr 
260 



Tyr\Arg Gin 
* 275 



His Ala Asn 



Leu 
285 



(2) INFORMATION FOR SEQ ID NO: 19: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 



(D) TOPOLOGY: linear 

(ii) MOLECULE other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

GCGCAATTAA CCCTCACTAa\ AGGGAACAAA 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 25\bsflse pairs 

(B) TYPE: nucleYgacid 

(C) STRANDEDNES|fc? unknown 

(D) TOPOLOGY: lMear 

(ii) MOLECULE TYPE: otheY nucleic acid 

(A) DESCRIPTION: \/desc = "synthetic DNA" 

(xi) SEQUENCE DESCRIPTION^ SEQ ID NO: 20: 

GGTACGCAGA ATNNNCTGGG CTCGC 



(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS :\ 

(A) LENGTH: 27 base pairs, 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown \ 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic\acid 

(A) DESCRIPTION: /desc = \synthetic DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID \{0: 21: 

GCGTAATACG ACTCACTATA GGGCGAA 



